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DETAILED ACTION 

1 . This action is in response to the application filed on 3/16/2004, and IDS filed on 
07/20/2004. 

2. Claims 1-42 are pending. Claims 1 , 32, 38, 39, 40, 41 , 42 are Independent 
claims. 

Claim Rejections • 35 USC § 101 

35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufectuire, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

3. Claims 39 and 40 are rejected under 35 U.S.C. 101 because the claimed 
invention is directed to non-statutory subject matter. 

With regards to claim 39, for the claimed "data packet" is similar in properties to a 
carrier wave. Since carrier waves, being a form of electromagnetic energy, do not fall 
into one of the statutory categories of 35 U.S.C. 101, the claim includes non-statutory 
subject matter. A detailed explanation describing why carrier waves are regarded as 
non-statutory subject matter under 35 U.S.C. 101 follows: 

Claims that recite nothing but the physical characteristics of a form of energy, such 
as a frequency, voltage, or the strength of a magnetic field, define energy or magnetism, 
per se, and as such are nonstatutory natural phenomena. O'Reilly, 56 U.S. (15 How.) at 
112-14. Moreover, it does not appear that a claim reciting a signal encoded with 
functional descriptive material falls within any of the categories of patentable subject 
matter set forth in § 101 . 
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First, a claimed signal is clearly not a "process" under § 101 because it is not a 

series of steps. The other three § 101 classes of machine, compositions of matter and 

manufactures "relate to structural entities and can be grouped as 'product' claims in 

order to contrast them with process claims." 1 D. Chisum, Patents § 1 .02 (1994). The 

three product classes have traditionally required physical structure or material. 

I* 

"The term machine includes every mechanical device or combination of mechanical 
device or combination of mechanical powers and devices to perform some function and 
produce a certain effect or result." Corning v. Burden, 56 U.S. (15 How.) 252, 267 
(1854). A modern definition of machine would no doubt include electronic devices which 
perform functions. Indeed, devices such as flip-flops and computers are referred to in 
computer science as sequential machines. A claimed signal has no physical structure, 
does not itself perform any useful, concrete and tangible result and, thus, does not fit 
within the definition of a machine. 

A "composition of matter" "covers all compositions of two or more substances and 
includes all composite articles, whether they be results of chemical union, or of 
mechanical mixture, or whether they be gases, fluids, powders or solids." Shell 
Development Co. v. Watson, 149 F, Supp. 279, 280, 113 USPQ 265, 266 (D.D.C. 
1957), affd, 252 F.2d 861, 116 USPQ 428 (D.C. Cir. 1958). A claimed signal is not 
matter, but a form of energy, and therefore is not a composition of matter. 

The Supreme Court has read the term "manufacture" in accordance with its 
dictionary definition to mean "the production of articles for use from raw or prepared 
materials by giving to these materials new forms, qualities, properties, or combinations, 
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whether by hand-labor or by machinery." Diamond v. Chakrabarty, 447 U.S. 303, 308, 
206 USPQ 193, 196-97 (1980) (quoting American Fruit Growers, Inc. v. Brogdex Co., 
283 U.S. 1,11,8 USPQ 131, 133 (1931), which, in turn, quotes the Century Dictionary). 
Other courts have applied similar definitions. See American Disappearing Bed Co. v. 
Arnaelsteen, 182 F. 324, 325 (9th Cir. 1910), cert, denied, 220 U.S. 622 (1911). These 
definitions require physical substance, which a claimed signal does not have. Congress 
can be presumed to be aware of an administrative or judicial interpretation of a statute 
and to adopt that interpretation when it re-enacts a statute without change. Lorillard v. 
Pons, 434 U.S. 575, 580 (1978). Thus, Congress must be presumed to have been 
aware of the interpretation of manufacture In American Fruit Growers when it passed 
the 1952 Patent Act. 

A manufacture is also defined as the residual class of product. 1 Chisum, § 1.02[3] (citing 
W. Robinson, The Law of Patents for Useful Inventions 270 (1890)). 

A product is a tangible physical article or object, some form of matter, which a 
signal is not. That the other two product classes, machine and composition of 
matter, require physical matter is evidence that a manufacture was also intended 
to require physical matter. A signal, a form of energy, does not fall within either of 
the two definitions of manufacture. Thus, a signal does not fall within one of the 
four statutory classes of § 101 . 
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With regards to claim 40, for the claimed "system" appears to be claiming "Non- 
Functional Descriptive Material". The applicant provides in the specification details 
concerning the term "component" as intended to refer to a computer-related entity, 
either hardware, a combination of hardware and software, software, or software in 
execution. Although the component can be executed, there is no indication in the claim 
that component is being executed, only that the referred system of claim 1 "comprises" 
a component, and thus does not present any functional interrelationship. See MPEP 
2106 below: 

Nonfunctional Descriptive Material 

Descriptive material that cannot exhibit any functional interrelationship with the 
way in which computing processes are performed does not constitute a statutory 
process, machine, manufacture or composition of matter and should be rejected under 
35 U.S.C. 101. Thus, Office personnel should consider the claimed invention as a whole 
to determine whether the necessary functional interrelationship Is provided. 

Where certain types of descriptive material, such as music, literature, art, 
photographs and mere arrangements or compilations of facts or data, are merely stored 
so as to be read or outputted by a computer without creating any functional 
interrelationship, either as part of the stored data or as part of the computing processes 
perfomned by the computer, then such descriptive material alone does not impart 
functionality either to the data as so structured, or to the computer. Such "descriptive 
material" is not a process, machine, manufacture or composition of matter. (Data 
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consists of facts, which become information when they are seen in context and convey 
meaning to people. Computers process data without any understanding of what that 
data represents. Computer Dictionary 210 (Microsoft Press, 2d ed. 1994).) 



Claim Rejections - 35 USC §112 

The following is a quotation of the first paragraph of 35 U.S.C. 112: 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the Inventor of carrying out his invention. 

4. Claims 3, 4, and 40 are rejected under 35 U.S.C. 112, first paragraph, as failing 
to comply with the enablement requirement. The claim(s) contains subject matter which 
was not described in the specification in such a way as to enable one skilled in the art to 
which it pertains, or with which it is most nearly connected, to make and/or use the 
invention. 

With regards to claim 3, although the applicant provides in the specification the 
details for using a lexicon, the applicant fails to provide implementation details for using 
a claimed lexicon ^without content; since a lexicon ^without content is not described 
anywhere in the specification. For the purposes of examination, the Examiner will 
assume a lexicon in general (with content) without regards the lexicon 'without content 
that the Applicant is claiming. 

With regards to claim 4, although the applicant provides in the specification the 
details using stop words, the applicant fails to provide implementation details for using a 
stop list 'without content; as a stop list 'without content is not described anywhere in 
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the specification. For the purposes of examination, the Examiner will assume a stop list 
in general (with content) without regards to the stop list 'without content' that the 
Applicant is claiming. 

With regards to claim 40, the applicant provides in the specification details 
concerning the term "component" as intended to refer to a computer-related entity, 
either hardware, a combination of hardware and software, software, or software in 
execution. Yet, it is claimed "A computer readable medium having stored thereon 
computer executable components of the system of claim 1", and since 'components' 
could be hardware, it is not feasible to claim hardware being stored on a computer 
readable medium. 

The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

5. Claims 3, 41 , 42 are rejected under 35 U.S.C. 1 1 2, second paragraph, as being 
indefinite for failing to particularly point out and distinctly claim the subject matter which 
applicant regards as the invention. 

Claim 3 recites the limitation "the group" in line 3 of claim 3. There is insufficient 
antecedent basis for this limitation In the claim. 

Claim 41 recites the limitation "the group" in line 2 of claim 41. There is 
insufficient antecedent basis for this limitation in the claim. 

Claim 42 recites the limitation "the group" in line 2 of claim 42. There is 
insufficient antecedent basis for this limitation in the claim. 
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Claim Rejections - 35 USC § 103 
The following is a quotation of 35 U.S.C. 103(a) which fomis the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not t>e obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary sl^ill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

6. Claims 1-4, 6-14. 18, 20-24, 32-35, 38, 39, and 40-42 are rejected under 35 

U.S.C. 103(a) as being unpatentable over Tang et al (US Patent: 6,636,849 B1, issued: 

Oct. 21 , 2003, filed: Nov. 22, 2000) in further view of Shanahan et al (US Application: 

US 2003/0033288 Al, published: Feb. 13, 2003, filed: Dec. 5, 2001) and Beeferman et 

al (US Patent: 6,701,309 81, issued: Mar. 2. 2004, filed: Apr. 21. 2000). 

With regards to claim 1 , Tang et al teaches a system that facilitates spell 

checking comprising: 

• A component that receives input data containing text (column 4, lines 55-66: 
whereas a search string is received) 

• A spell checking component that identifies potentially misspelled strings in the 
text, and pmposes at least one alternate spelling for the string (column 7. 
lines 20-30: whereas, the Tang et al's system teaches spell checking 
potentially misspelled words using a dictionary/lexicon, and returning a 
suggestion to the user concerning a least one alternate spelling.) 

However, Tang et al does not teach creating/using substrings of the text, and 
providing an alternate spelling for the substring set, based on at least one query log; the 
query log comprising data utilized by users to query a data collection over a time frame. 
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Shanahan et al teaches a spell checking system (paragraph 0518: whereas, 
Shanahan et al's system takes text, and identifies text that need spelling corrections). 
The speir checking system takes substring data from the input text (Fig 31: whereas, 
text from a document is processed by tokenizing words, and identifying N-Gram of 
words from the input text after removal of stop words). All words (substrings of the input 
text) are iteratively are processed and corrected to generate a set of alternate spellings 
for the input text as shown in Fig. 51 . 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al's spell checking component to further take and 
process and provide alternate spellings for substrings of input text as taught by 
Shanahan et al. The combination of Tang et al and Shanahan et al would have allowed 
Tang et al's system to have "identified errors in a document, by formulating a query 
using identified errors in document content, identifying a set of entities in the database 
of entities that satisfies the query; correcting the document content using the identified 
set of entities, and updating the information space with the corrected document content" 
(paragraph 0015). 

However, Tang et al and Shanahan et al do not teach the alternate spelling of a 
substring set is based on at least one query log; the query log comprising data utilized 
by users to query a data collection over a time frame. 

Beeferman et al teaches an alternate spelling of a string query (column 1 , lines 
51-54: whereas, a system for query refinement includes suggesting an alternate spelling 
or a corrected spelling for a query) is based on data stored in a query log file (columns 
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10 and 11, lines 52-64 and 1-10 respectively: whereas, based on heuristic data from 
query log data, it is determined if a suggested spelling is appropriate), the query log 
comprising data utilized by users to query a data collection over a time frame (Table 2, 
column 9, lines 45-67, and column 10, lines 1-6: whereas a query log holds data about 
the number of occurrences for each particular query/string has been submitted by a 
particular class of users in searches, over a period of time). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, and Shanahan et al's spell checking component 
such that spell checks using substrings are additionally based off of heuristic query log 
data, as taught by Beeferman et al. The combination of Tang et al, Shanahan et al, and 
Beeferman et al would have allowed Tang et al's system to have been able to have 
implemented a spell checking system that would have "refined a presentation of an 
alternative query to a first query based on a searcher's tendency to utilize information" 
(column 2, lines 27-30), and to have also " collected related queries that have a 
likelihood of being submitted by a class of searcher" (column 2, lines 24-26). 

With regards to claim 2, which depends on claim 1 , for a spell checking 
component further utilizes user-dependent information in proposing at least one 
alternative spelling, is similarly taught by Tang et al, Shanahan et al, and Beeferman et 
al, in claim 1, and is rejected under the same rationale. 

With regards to claim 3, which depends on claim 1, Tang et al teaches the 
alternative spelling for the substring set is further based on at least one trusted lexicon 
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with content (column 7, lines 23-29: whereas, a dictionary which comprises the correct 
spelling of words, is used as a basis for providing an alternative spelling). 

With regards to claim 4, which depends on claim 3, Tang et al teaches the spell 
checking component, in claim 1 , and is rejected under the same rationale. However 
Tang et al does not teach the spell checking component further employs a list of stop 
words. 

Shanahan et al teaches a list of stop words with content (paragraph 0365: 
whereas, a set/list of stop words are used to normalize input text data for contextual 
classification). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al's spell checker such that the input text data will be 
normalize by removing stop words as taught by Shanahan et al. The combination would 
have allowed Tang et al's spell checker to have been able to remove stop words "that 
do not improve the quality of classification" (paragraph 0365). 

With regards to claim 6, which depends on claim 4, Tang et al teaches a spell 
checking component, in claim 1 , and is rejected under the same rationale. Furthermore, 
Tang et al teaches an Iterative process to search a space of alternative spellings (Fig 6, 
column 12, lines 1-30: whereas, processing for exact or inexact matches are performed 
on a search tree start, and iterations or a loops take place until the last level of a search 
tree is reached (looping occurs from reference numbers 630 to 670 and then back to 
630 in Fig 6). 
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With regards to claim 7, whicli depends on claim 6, Tang et al teaclies a spell 
checking component, in claim 1 , and is rejected under the same rationale. Furthermore, 
Tang et al teaches at least in part, tieuiistics to impose restrictions on a search space 
utilized to determine a proposed alternative spelling (column 15, lines 60-67, and 
column 16, lines 1-5: whereas heuristic methods are used to impose restrictions on a 
search space by calculating a distance score that is used in detemriining candidates for 
alternative spellings)). 

With regards to claim 8, which depends on claim 7, Tang et al teaches the 
heuristics in claim 7, and is rejected under the same rationale. Furthennore, Tang et al 
teaches the heuristics utilize, at least in part, at least one fringe to limit the search space 
(column 9, lines 59-67, and column 10, lines 1-5: whereas, several fringes are 
implemented to limit the search space, such as the probabilistic distance function 
having to be positive, and a triangle inequality has to be satisfied). 

With regards to claim 9, which depends on claim 4, Tang et al, Shanahan et al, 
and Beefemian et al similarly teach the query log comprising a histogram of queries 
asked over a time frame, as explained in claim 1, and is rejected under the same 
rationale. 

With regards to claim 10, which depends on claim 9, Tang et al, Shanahan et al, 
and Beefemian et al teach the histogram of queries, as explained in claim 9, and is 
rejected under the same rationale. Tang et al, Shanahan et al, and Beeferman et al also 
teach the histogram of queries relates to a subset/class of the users, as explained in 
claim 1 , and is rejected under the same rationale. Furthermore, the su6se//class 
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comprises at least one user (Beeferman et al, column 5, lines 7-9: whereas a particular 
class of searchers represents a subset of users with at least one searcher/user).- 

With regards to claim 1 1, which depends on claim 9, Tang et al, Shanahan et al, 
and Beeferman et al teach a query log, as explained in claim 1, and is rejected under 
the same rationale. Furthermore, Beeferman et al teaches the query log resides on a 
server computer {column 5, lines 39-40: whereas the query log is downloaded from a 
search engine/server computer to the client computer, and thus the query log originally 
resides on the server computer). 

With regards to claim 12, which depends on claim 9, Tang et al, Shanahan et al, 
and Beeeferman et al teach a query log, as explained in claim 1 , and is rejected under 
the same rationale. Furthemiore, as explained in claim 1 1 , the query log is downloaded 
from the server to the client, and thus the query log resides on the client computer as 
well. 

With regards to claim 1 3, which depends on claim 9, Tang et al, Shanahan et al, 
and Beeferman et al teach: the spell checking component utilizes substrings from at 
least one query log, as explained in claim 1 , and is rejected under the same rationale. 
Furthermore, Beeferman et al teaches the query log comprises occurrence and co- 
occurrence statistics in Table 2 (whereas, the query log comprises the number of 
occurrences for each particular query (query string) requested. Furthermore, the 
number of occurrences are shown to be greater than one, and thus, co-occurrence 
counts for each of the particular queries are also recorded). 
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It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, Shanahan et al, and Beeferman et al's substring 
spell checking system to use occurrence and co-occurrence statistics on substring data 
from at least one query log as taught by Beeferman et al. The combination would have 
allowed Tang et al's system to have been able to spell check text using occurrence and 
co-occurrence statistics for determining an "ordered rank according to mutual 
information value" (column 9, 60-62). 

With regards to claim 14, which depends on claim 13, Tang et al, Shanahan et al, 
and Beeferman et al teaches a substring comprising at least one selected from the 
group consisting of an entry in at least one lexicon, as explained in claim 3, and is 
rejected under the same rationale. 

With regards to claim 18, which depends on claim 13, Tang et al, Shanahan et al, 
and Beeferman et al teach substring co-occunence statistics from the query log, as 
explained in claim 13, and is rejected under the same rationale. Furthermore, the query 
information is stored in a single data structure/log by downloading from a server as 
explained in claim 1 1 , and is rejected under the same rationale. 

With regards to claim 20, which depends on claim 18, Tang et al and Shanahan 
et al teach a spell checking system handling split substrings by splitting input text into 
an N-Gram set of words, as explained in claim 1 , and is rejected under the same 
rationale. Furthermore, Shanahan et al teaches a method for using heuristics to 
determine word similarity, which does not differ/(operafes in the same mannei), if the 
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input text is an individual string or an N-Word split substring using the searching 
technique that was explained in claim 6 above. 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, Shanahan et al's substring spell checking 
component, to further utilize the string (individual or substring) independent method for 
searching a search space, as also taught by Shanahan et al. The combination of Tang 
et al, Shanahan et al, and Beeferman et al would have allowed Tang et al's spell 
checking component to have been able to have provided for expanded search results if 
needed by searching split substrings. 

With regards to claim 21 , which depends on claim 20, Tang et al, Shanahan et al, 
and Beefemnan et al teach the spell checking component generates a set of alternative 
spellings'that are substrings in a at least one selected from the group consisting of at 
least one query log and at least one lexicon, as explained in claim 1 , and is rejected 
under the same rationale. 

With regards to claim 22, which depends on claim 21 , Tang et al, Shanahan et al, 
and Beeferman et al teach the set of alternative spellings comprising a set of alternative 
spellings, as explained in claim 1, and is rejected under the same rationale. 
Furthermore, Shanahan et al teaches the alternative spellings are determined via an 
iterative correction process (paragraph 051 1 : whereas, through an iterative correction 
process, text/string/substring in a document gets replaced/corrected with another 
substring as an alternative spelling. Furthermore, the iterative correction process halts 
when all the number of errors corrected at a previous iteration is less than a threshold 
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value, and thus the possible alternative spellings are less appropriate than the current 
set of alternative spellings). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, Shanahan et al, and Beeferman et al's spell 
correction component, to have further implemented the generation of alternate spellings 
in an iterative correction process as also taught by Shanahan et al. The combination of 
Tang et al, Shanahan et al, and Beeferman et al would have allowed Tang et al's 
system to have repeatedly analyzed input text content until a satisfying correction level 
has been established. 

With regards to claim 23, which depends on claim 22, Tang et al, Shanahan et al, 
and Beeferman et al teach the iterative correction process, comprising a plurality of 
iterations that change at least on substring to another substring as an alternative 
spelling, the iterative correction process halts wheri all possible alternative spellings are 
less appropriate than a cun-ent set of alternative spellings, as explained in claim 22, and 
is rejected under the same rationale. 

With regards to claim 24, which depends on claim 23, Tang et al teaches 
alternative spellings, in claim 1 , and Is rejected under the same rationale. Tang et al 
also teaches the appropriateness of alternative spellings are computed based on a 
probabilistic string distance, as explained in claim 7, and is rejected under the same 
rationale. Tang et al however, does not teach the appropriateness of alternative 
spellings are computed based on a statistical context model. 
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Shanahan et al teaches the appropriateness of alternative spellings are 
computed based on a statistical context model (paragraph 0243: whereas the context of 
the words surrounding a substring/entity is taken into account, and using ranking 
methods, only the highest ranked results are kept as appropriate for an alternative 
spelling). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al's spell correction system for providing alternative 
spellings, to further include the ability to determine appropriateness for alternative 
spellings based on not only string distance, but through context - statistical analysis as 
well. The combination of Tang et al, Shanahan et al, and Beeferman et al would have 
allowed Tang et al's system to have improved the accuracy of alternative spellings by 
taking the context of the input text into account when providing alternative results. 

With regards to claim 32, Tang et al, Shanahan et al, and Beefemnan et al similarly 
teach a method comprising: 

• Receiving input data containing text, as explained in claim 1 , and is rejected 
under the same rationale. 

• Identifying a set of potentially misspelled substrings in the text, as explained in 
claim 1 , and is rejected under the same rationale. 

• Proposing at least one alternative spelling for the substring set based on at least 
one query log; the query log comprising data utilized by users to query a data 
collection overtime, as explained in claim 1, and is rejected under the same 
rationale. 
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With regards to claim 33, Tang et al, Shanahan et al, and Beeferman et al similarly 
teach a method comprising: 

• The alternative spelling for the substring set is further based on a at least one 
trusted lexicon; the trusted lexicon comprising at least one trusted lexicon with 
content, as explained in claim 3, and is rejected under the same rationale. 

With regards to claim 34, which depends on claim 33, Tang et al, Shanahan et al, 
and Beeferman et al similarly teach a method comprising: 

• Employing, at least in part, a list of stop words to facilitate in determining at least 
one alternative in spelling; the list of stop words comprising at least one selected 
from the group consisting of a list of stop words with content, as explained in 
claim 4, and is rejected under the same rationale. 

• Utilizing substring occurrence and co-occurrence statistics from at least one 
query log, as explained in claim 13, and is rejected under the same rationale. 

• The query log comprising a histogram of queries asked over a time frame, as 
explained in claim 9, and is rejected under the same rationale. 

• The substring occurrence and co-occurrence statistics from the query log are 
stored in a same searchable data structure, as explained in claim 18, and is 
rejected under the same rationale. 

• Handling split substrings in the same manner as handling individual substrings, 
as explained in claim 20, and is rejected under the same rationale. 
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• Generating a set of alternative spellings that are substrings in at least one 
selected from the group consisting of at least one query log and at least one 
lexicon, as explained in claim 21, and is rejected under the same rationale. 

• The set of alternative spellings comprising a set of altemative spellings 
determined via an iterative correction process, as explained in claim 22, and is 
rejected under the same rationale. 

With regards to claim 35, which depends on claim 34, Tang et al, Shanahan et al, 
and Beeferman et al similarly teach method comprising: 

• Changing at least one substring to another substring as an altemative spelling, 
as explained in claim 23, and is rejected under the same rationale. 

• Halting the iterative conrection process when all possible altemative spellings are 
less appropriate than a current set of altemative spellings, as explained in claim 

23, and is rejected under the same rationale. 

• The altemative spellings and their appropriateness are computed based on a 
probabilistic string distance and a statistical context model, as explained in claim 

24, and is rejected under the same rationale. 

With regards to claim 38, Tang et al, Shanahan et al, and Beeferman et al similarly 
teach a system comprising: 

• Means for receiving input data containing text, as described in claim 1 , and is 
rejected under the same rationale. 

• Means for identifying a set of potentially misspelled substrings in the text and 
proposing at least one altemative spelling for the substring set based on at least 
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one query log; the query log comprising data utilized by users to query a data 
collection over a time frame, as described in claim 1, and is rejected under the 

t 

same rationale. 

With regards to claim 39, Tang et al, Shanahan et al, and Beeferman et al teach a 
spell checking system that provides, at least in part, at least one alternative spelling for 
a string set based, at least in part, on at least one query log, as explained in claim 1 and 
is rejected under the same rationale. Furthemnore, since Beeferman et al's system 
comprises network communication for transmitting query log data between server and 
computer, as explained in claim 1 1 , and also it is inherent that since the query log was 
sent through Internet communication methods, the communication inherently comprise 
packets of data (Beeferman et al, column 3, line 9) and thus a data packet is transmitted 
between two or more computer components as well to facilitate spell checking. 

With regards to claim 40, Tang et al, Shanahan et al, and Beefemrian et al similarly 
teach a computer readable medium having stored thereon, computer executable 
components of the system of claim 1, as similarly explained in claim 1, and is rejected 
under the same rationale. 

With regards to claim 41 , Tang et al, Shanahan et al, and Beefennan et al similarly 
teach a device employing the method of claim 32, comprising at least one selected from 
the group consisting of a computer, a server, and a handheld device, as explained in 
claim 32, and is rejected under the same rationale. 

With regards to claim 42, Tang et al, Shanahan et al, and Beefennan et al similarly 
teach a device employing the system of claim 1 , comprising at least one selected from 
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the group consisting of a computer, as server, and a handheld electronic device, as 
explained in claim 1, and is rejected under the same rationale. 
7. Claims 5 is rejected under 35 U.S.C. 103(a) as being unpatentable over Tang et 
al (US Patent: 6,636,849 B1, issued: Oct. 21, 2003, filed: Nov. 22. 2000), US 
Application: US 2003/0033288 Al, published: Feb. 13. 2003, filed: Dec. 5, 2001). and 
Beeferman et al (US Patent: 6,701,309 81, issued: Mar. 2, 2004, filed: Apr. 21, 2000) in 
further view of de Hita et al (US Patent: 6,081,774, issued: Jun. 27, 2000, filed: Aug. 22, 
1997). 

With regards to claim 5, Tang et al and Shanahan et al, teach a list of stop 
words, as explained in claim 4, and is rejected under the same rationale. However. 
Tang et al and Shanahan et al do not teach the list of stop words coritaining high 
frequency words and function words and their frequent misspellings. 

Hita et al teaches a list of stop/fekip words containing high frequency words, 
function words, and their frequent misspellings: whereas a stop/skip list is implemented 
for high frequency words and function words (column 1, lines 43-44), and frequent 
misspellings (column 2, lines 10-12). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, and Shanahan et al's list of stop words to further 
include high frequency words, function words, and their frequent misspellings as taught 
by Hita et al. The combination of Tang et al, Shanahan et al, Beeferman et al, and Hita 
et al would have allowed Tang et al's spell checking component to have been able to 
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normalize an input data/string set to focus on words that provide more semantic 
content. 

8. Claim 1 5 is rejected under 35 U.S.C. 103(a) as being unpatentable over Tang et 
al (US Patent: 6,636,849 B1. issued: Oct. 21, 2003, filed: Nov. 22, 2000), US 
Application: US 2003/0033288 Al, published: Feb. 13, 2003, filed: Dec. 5, 2001), and 
Beeferman et al (US Patent: 6,701,309 81, issued: Mar. 2, 2004, filed: Apr. 21, 2000) in 
further view of Hitachi (Denwent, published: Feb 16, 2001 , Abstract). 

With regards to claim 15, which depends on claim 13, Tang et al, Shanahan et al, 
and Beefemnan et al teach substring co-occurrence statistics, as explained in claim 13, 
and is rejected under the same rationale. However, Tang et al, Shanahan et al, and 
Beeferman et al do not teach the statistics comprising substring bigram counts; a 
substring bigram comprising a pair of substrings in a text. 

Hitachi teaches the statistics comprising substring bigram counts; a substring 
bigram comprising a pair of substrings in a text (Abstract: whereas, a collecting unit 
collects substrings / bigram strings from a document, and a counter counts the 
occurrence(s) for the pair of bi-grams). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, Shanahan et al, and Beeferman et al's substring 
co-occurrence statistics such that they also include bigram counts from a pair of 
substrings in a text/document, as taught by Hitachi. The combination of Tang et al, 
Shanahan et al, Beeferman et al, and Hitachi, would have allowed Tang et al's spell 
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checking component to have been able to evaluate each pair of bigrams "in order of 
degree of importance" (Hitachi, Abstract). 

9. Claim 16 is rejected under 35 U.S.C. 103(a) as being unpatentable over Tang et 
al (US Patent: 6,636,849 B1, issued: Oct. 21, 2003, filed: Nov. 22, 2000), US 
Application: US 2003/0033288 Al , published: Feb. 13, 2003, filed: Dec. 5, 2001), 
Beeferman et al (US Patent: 6,701,309 81, issued: Mar. 2, 2004, filed: Apr. 21, 2000) 
and Hitachi (Denwent, published: Feb 16, 2001, Abstract), in further view of Herz et al 
(US Patent: 5,754,939, issued: May 19, 1998, filed: Oct 31, 1995). 

With regards to claim 16, which depends on claim 15, Tang et al, Shanahan et al, 
and Beeferman et al, and Hitachi teach the substring bigram comprising a pair of 
substrings in text, as explained in claim 15, and is rejected under the same rationale. 
However, Tang et al, Shanahan et al, Beeferman et al, and Hitachi do not expressly 
teach that the bigrams are adjacent substrings in a text. 

Herz et al teaches the bigrams are adjacent substrings in a text (column 13, lines 
28-30: whereas, text is broken into bigrams, which are 2 adjacent words). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, Shanahan et al, Beeferman et al, and Hitachi's 
spell checking component to further include the ability to process substring bigrams that 
are adjacent in a text, as taught by Herz et al. The combination of Tang et al, Shanahan 
et al, and Beeferman et al, Hitachi et al, and Herz et al would have allowed Tang et al's 
spell checking component to have been able to process bigrams that are contextually 
close to each other. 
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10. Claims 19. 26, 27, 28, 29, 30, and 36 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Tang et al (US Patent: 6,636,849 B1, issued: Oct. 21, 2003, 
filed: Nov. 22, 2000). US Application: US 2003/0033288 Al, published: Feb. 13, 2003, 
filed: Dec. 5, 2001), and Beefemian et al (US Patent: 6,701,309 B1, issued: Mar. 2, 
2004, filed: Apr. 21, 2000) in further view of Srihari et al (ACM, published: January 
1983, pages 72-75). 

With regards to claim 19, which depends on claim 18, Tang et al teaches a tree 
data structure extracted from a lexicon (column 7, lines 21-23). However, Tang et al 
does not teach a data structure comprising a trie. 

Srihari et al teaches a data structure comprising a trie (Section 3, P3-1, Figure 2: 
whereas, a data structure used to represent a lexicon is a trie). 

It would have been obvious to one of the ordinary skill In the art at the time of the 
invention to have modified Tang et al's method for representing a lexicon in the form of 
a trie, as taught by Srihari et al. The combination of Tang et al, Shanahan et al, 
Beeferman et al, and Srihari et al, would have allowed Tang et al's spell checking 
component to have implemented a "data structure that is suitable for determining 
whether a given string is an initial substring" (Srihari et al, Section 3, P3-2). 

With regards to claim 26. which depends on claim 24, Tang et al, Shanahan et al, 
and Beeferman et al teaches a set of alternative spellings for a substring is generated, 
in as explained in claim 1 , and is rejected under the same rationale. Tang et al also 
teaches a searchable string data structure extracted from a trusted lexicon (column 7, 
lines 22-24: whereas, a structured tree for a whole dictionary/lexicon is created for 
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searching). Furthermore, Beeferman teaches a searchable query log data structure 
(Table 2, whereas, a flat data structure is used to store occurrence and co-occurrence 
query data). However, Tang et al, Shanahan et al, and Beeferman et al do not expressly 
teach a searchable substring data structure. 

Srihari et al teaches a searchable substring data structure (Page 72-73, Section 
3. Lexical Organization, Fig. 2: whereas, a trie is extracted from a lexicon, for which the 
trie is used to implement a searchable substring data structure using the Viterbi 
algorithm). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, Shanahan et al, and Beeferman et al's spell 
correction system such that trie data structures storing substrings are extracted from a 
particular source (such as a lexicon or query log), such that the alternative spellings are 
generated from a trie using a Viterbi algorithm, as taught by Srihari et al. The 
combination of Tang et al, Shanahan et al, Beeferman et al, and Srihari et al would 
have allowed Tang et al's spell checking system to have used a data structure that is 
"efficient for text correction algorithms" (Srihari et al, page 72, Section 3). 

With regards to claim 27, which depends on claim 26, Tang et al and Shanahan 
et al teach the processing of substrings from input text, in claim 1 , and is rejected under 
the same rationale. Also Tang et al teaches the set of alternative strings for each string 
query is restricted to within a probabilistic distance from an input string (column 1 0, lines 
32-42: whereas, alternative spellings for a string are based on several factors, including 
the probabilistic distance); the restriction is imposed within each iteration without limiting 
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the iterative correction process as a whole (column 10, lines 48-60: whereas, the 
process is repeatedly extended to multiple search spaces or "grids"). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have used Tang et al, and Shanahan et al's method for processing 
substrings from input text, and additionally used Tang et al's method for iteratively 
processing the search space with a substring by applying probabilistic distance 
calculations. The combination of tang et al, Shanahan et al, Beeferman et al, and 
Srihari et al would have allowed Tang et al's system to increase the speed and 
relevancy of possible alternative spellings for a given input substring. 

With regards to claim 28, which depends on claim 27, Tang et al, Shanahan et al, 
and Beeferman et al teaches the iterative correction process, in claim 6, and is rejected 
under the same rationale. Furthermore, Shanahan et al teaches an iterative correction 
process searches for an optimum set of alternative spellings via utilization of a statistical 
context model: whereas the context of the words surrounding a substring/entity is taken 
into account, and using ranking methods, only the highest ranked results are kept as 
appropriate for an alternative spelling (paragraph 0243) and the iterative correction 
process halts when all the number of errors corrected at a previous iteration is less than 
a threshold value, and thus the possible alternative spellings are less appropriate than 
the current set of alternative spellings (paragraph 051 1 : iterative process stops when 
the number of errors corrected is less than a threshold (optimal value)). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have further modified Tang et al, Shanahan et al, and Beeferman et al's 
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iterative correction system to further include the ability to use a statistical context model 
to determine an optimal set of alternative spellings, as taught by Shanahan et al. The 
combination of Tang et al, Shanahan et al, Beeferman et al, and Srihari et al, would 
have allowed Tang et al's spell checking component to have been able to iteratively go 
though a search space, and choosing alternative spellings based on context of the input 
sentence/string/substring. 

With regards to claim 29, which depends on claim 28, Tang et al and Shanahan 
et al teach the statistical context model, as explained in claim 28, and is rejected under 
the same rationale. Furthermore, Tang et al, and Shanahan et al, Beeferman et al, and 
Srihari et al teach extracting substring data from a query log and a lexicon as explained 
in claim 26, and is rejected under the same rationale. However Tang et al and 
Shanahan et al do not teach the statistical context model comprising substring 
occunence and co-occurrence statistics extracted from at least one query log. 

Beeferman et al teaches a statistical context model comprising substring 
occurrence and co-occurrence statistics from at least one query log (Table 2: whereas, 
the statistics shown in the table show the ranking of occurrence and co-occurrence 
statistics in a query log. The statistics are based from the context of queries logged for a 
particular class of user(s) (column 9, lines 62-65)). 

It would have been obvious to one of the ordinary skill in the art at the time of the 
invention to have modified Tang et al, Shanahan et al's statistical context model such 
that it can implement query/string occurrence and co-occurrence statistics extracted 
from at least one contextually-sensitive query log as taught by Beeferman et al. The 
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combination of Tang et al, Slianahan et al, Beeferman et al, and Srihari et al would 
have allowed Tang et al's spell checking component to have been able to use 
history/log data that was contextually sensitive to a particular class of user/field of 
search such that more relevant results would have been produced/suggested. 

With regards to claim 30, which depends on claim 29, Tang et al, Shanahan et al, 
Beefennan et al, and Srihari et al teach: 

• A Vitertoi search is employed to facilitate in determining the optimum set of 
alternative spellings, as explained in claim 26, and is rejected under the same 
rationale. 

• Alternate spellings are determined according to the context model in each 
iteration, as explained in claim 28, and is rejected under the same rationale. 

With regards to claim 36, which depends on claim 35, Tang et al, Shanahan et al, 
Beeferman et al, and Srihari et al similarly teach a method comprising: 

• Utilizing a searchable data structure extracted from at least one query log and at 
least one trusted lexicon to generate the set of alternative spellings for a 
substring, as explained in claim 26, and is rejected under the same rationale. 

• Restricting the set of alternative spellings for each substring to within a 
probabilistic distance from an input substring, the restriction being imposed withn 
each iteration without limiting the iterative correction process as a whole, as 
explained in claim 27, and is rejected under the same rationale. 

• Searching for an optimum set of alternative spellings via utilization of a statistical 
context model, as explained in claim 28 and is rejected under the same rationale. 
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• The statistical context model comprising substring occurrence and co-occun-ence 
statistics extracted from at least one query log, as explained in claim 29, and Is 
rejected under the same rationale. 

Allowable Subject Matter 

1 1 . Claims 1 7, 25, 31 , and 37 are objected to as being dependent upon a rejected 
base claim, but would be allowable if rewritten in independent form including all of the 
limitations of the base claim and any intervening claims. 

Conclusion 

12. The prior art made of record and not relied upon is considered pertinent to 
, applicant's disclosure. 

• Golding (US Patent: 5,659,771, issued: Aug. 19, 1997, filed: May 19, 

1995) : This reference teaches spell checking based on contextual 
analysis. 

• Nielson (US Patent: 5,875,443, issued: Feb 23, 1999, filed: Jan. 30, 

1996) : This reference teaches internet/network based spell checking, 
whereas, the server houses a database of approved spellings. 

• Nielson (US Patent: 5,892,919, issued: Apr. 6, 1999, filed: Jun. 23, 1997): 
This reference teaches a simulation of a log at the server side, which 
stores commonly misspelled URLs. 

• Ortega et al (US Patent: 6,853,993 B2, issued: Feb 8, 2005. filed: Apr. 1 , 
2002): This reference teaches suggesting spelling corrections based on 
query log data. 
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• De Campos (US Patent: 6,272.456 B1 . issued: Aug 7. 2001 , filed: Mar. 1 9, 
1998): This reference teaclies using n-gram methods/data structures for 
language detection. 

• Kuklch (ACM, published December 1992, pages 377-439): This reference 
teaches known spell checking/correction algorithms in the industry, 
including Viterbi, and Damerau-Levenshtein. 
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